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ABSTRACT 


In  this  paper,  the  authors  consider  the  problem  of  change  points  within 
the  framework  of  model  selection  procedures  using  information  theoretic  criteria 
The  authors  proposed  procedures  for  estimation  of  the  locations  of  change  points 
and  the  number  of  change  points.  The  strong  consistency  of  these  proceudres 
is  also  established.  Also,  the  problem  of  change  points  is  discussed  within 
the  framework  of  the  simultaneous  test  procedures. 


Keywords  and  phrases:  Change  points,  Consistency,  Edge  detection.  Information 
theoretic  criterion.  Model  selection,  Quality  control. 


1 .  INTRODUCTION 


Problems  of  detection  of  change  points  arise  in  many  areas.  For  example, 
in  continuous  production  process,  it  is  of  interest  to  find  out  the  point  at 
which  deterioration  in  the  quality  of  the  product  starts.  Change  point  problems 
arise  (see  Mazumdar,  Sinha  and  Li  (1985))  in  the  area  of  edge  detection.  For 
discussions  on  some  other  applications,  the  reader  is  referred  to  Page  (1957) 
and  Holbert  and  Broemling  (1977). 

When  the  underlying  distribution  is  normal,  the  change  point  problem  is 
equivalent  to  detection  of  change  in  mean  and/or  variance.  Chernoff  and  Zacks 
(1964),  Page  (1955),  Hinkley  (1970)  and  several  other  workers  investigated  the 
problem  of  change  point  in  the  mean.  Sen  and  Srivastava  (1973)  and  Srivastava 
and  Worsley  (1986)  investigated  the  problem  of  detection  of  the  change  in  the 
mean  vectors.  In  the  above  papers,  change  point  problems  were  studied  within 
the  framework  of  tests  of  hypotheses. 

The  object  of  this  paper  is  to  study  change  points  in  the  mean  vectors 
within  the  framework  of  model  selection  procedures  using  information  theoretic 
criteria.  In  Section  2,  we  give  some  preliminaries  which  are  needed  in  the 
sequel.  Section  3  is  devoted  to  the  problem  of  estimation  of  the  locations  of 
change  points  when  the  number  of  change  points  is  known.  The  strong  consistency 
of  the  procedure  is  also  established.  In  Section  4,  we  assume  that  the  number 
of  change  points  is  unknown  and  propose  procedures  for  the  estimation  of  the 
number  and  locations  of  change  points.  The  strong  consistency  of  these  procedures 
is  also  established.  Finally,  in  Section  5,  we  discussed  the  change  point 
problem  within  the  framework  of  simultaneous  test  procedures. 
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2.  PRELIMINARIES 


Let  x(t)  be  an  independent  p-dimensional  process  on  [0,1]  such  that 


x(t)  *  p(t)  +  v(t),  0  <  t  <  1 


(2.1) 


where  p(t)  is  a  non-random  p-dimensional  left-continuous  step  function  and 

v(t)  is  an  independent  p-variate  normal  process  with  mean  vector  0  and 

covariance  matrix  I.  Denote  the  jump  points  of  p(t)  by  t^,...*t  ;(0  <  t^ 

<  t^  <  . . •  <  t^  <  1 ) .  Then  are  called  the  change  points  of  the 

process  x(t).  We  assume  that  N  samples  are  drawn  from  the  process  x(t)  in 

1  N 

equal  space,  say  x(jj) , .  . .  ,x(j-j) .  Then,  it  is  of  interest  to  estimate  the 

1  N 

location  of  (t, . t  )  from  observations  x(rr) , . . .  ,x(t;)  .  Here  it  is 

I  q  -  N  -  N 

1  N 

emphasized  that  for  different  N,  the  series  {x(jj) . x(m)}  is  different. 

Sometimes,  the  number  q  of  change  points  is  unknown  and  in  this  case  we  have 
to  estimate  q.  In  this  paper,  it  is  assumed  a  priori  that  q  <  L,  a  constant. 

We  now  define  some  concepts  and  introduce  some  notations  which  are  useful  in 
the  sequel.  From  the  integer  interval  [0,N],  we  can  pick  £  integers  from 
[1,N-1]  to  divide  [0,N]  into  £  +  1  sections.  Let  k^  <  . . .  <  k^  (CKk.<N) 
denote  the  £  integers  picked  from  [l,N-lj.  Then,  we  call  (^.....k  )  a 
partition  of  the  interval  [ 0 , N ] ,  and  k^  1  <_  i  <_£,  a  cut  off  point  of  this 
partition.  Sometimes,  we  denote  (k^,...,k^)  by  or  it  simply.  A  partition 
(z^,...,z^)  is  called  a  refinement  partition  of  (k^,...,k^)  if  the  set 
(kj .k^, . . . ,k^}  is  a  proper  subset  of  {z^,...,z^}.  This  fact  is  written  as 

(Zj . zj  ( k  j  , . . . ,  k^  ) .  If  we  define  tt.  -  (Zj . z.),  -  (k1 . k  ), 

the  above  relation  is  written  as  tt^. 

Define  x^  ^  »  x(^j,  *  1  i  1  N.  It  is  obvious  that  xf^  is  dependent  on 
the  sample  number  N.  Throughout  this  paper,  we  will  use  x.  instead  of  x.  , 


but  we  must  keep  in  mind  that  x^  is  dependent  on  N. 

If  x  is  a  vector,  then  x*  denotes  the  transpose  of  x.  Also,  the 
maximum  and  minimum  eigenvalues  of  the  matrix  M  are  denoted  by  ^(H)  and 

Amin(M)  respectively.  Throughout  this  paper,  etc.  denote  some 

positive  constants  which  can  assume  different  values  at  each  of  their  appearances, 
and  the  Eculidean  norm  of  M  is  denoted  by  | |M| |.  Now  we  shall  prove  some  lemmas. 

At  first,  we  cite  a  lemma  by  Fuk  and  Nageav  (1971) 

Lff™1}3  2^_1.  Let  x1,...,xn  be  independent  random  variables  with  Ex.  =0, 

and  E(xilt  «  i  -  1.2 . n.  For  t  »  2,  write  Sn  -  J  ,  , 2  =  J  Varu  1. 

n  i=l  1  n  i=i  1 

n  *  I  E  |  x^  | t .  Then  for  x  >  0, 


P(Sn  »  x)  <  C<1)»t  |/t  +  exp{-C^V/B^> 


(2)2 ,„2, 


where 

CtU  *  (1+  and  Ct2)  "  2(t+2)~2e*t . 

In  the  original  paper,  C^2^  *  2(t+2^  ^e  this  is  a  printing  error. 

Lemma  2.2.  Let  x......x,_  be  iid  p*l  normal  vectors  with  mean  vector  0  and 

-1  -N 

positive  definite  covariance  matrix  Z.  Now,  let 


an  "  n  l  (vW(vW 


(2.2) 


R 


k 


* 


P<  WV  5  2  W£)>  <  <*' 


1  N  i  --  i 

Vlt  Vi  ■  5?  • 


Let 


i=l 


?i  *  (xil . V 


1  N 

yjk  *  N  l  xijxik 
i=l 


<  cn_(L+2  ^ 

(2.4 

<  CN‘a+2) 

(2.5 

ON  =  -»  -0N1  =  xr 

-NjN  =  ~2’ 

(2.6) 

<  N 

,k  <  p- 

(2.7) 

It  is  easy  to  see  that  y.,  is  the  average  of  N  iid  random  variables.  Its 

JK 

fourth  moment  is  finite.  Let  Ex^x^  *  Mj ^  and  put  t  =  4  in  the  Lemma  2.1. 
Then  for  any  e  >  0, 

?  (xijxik_vljk)l  -  <  ciN’a+2).  (2-8) 

2p 


1*1 


and 


P<  |  I xx '  I  (  >  e/2/  <  CjN' 


..-(L+2) 


l 


But 


r  .  <„  >:p«p  and  ±  J  -  (i  J  z  xu>  :p>p. 


jk 


i-1 


i-1 


It  follows 


P{  I  1  -  Ell  i  e>  <  C2N 


■  ( L+2 ) 


12.9; 


:kh  mrxr.  ww 


Then 


Let  A^(A)  >_  . . .  >_  Ap(,A)  denote  the  eigenvalues  of  a  pxp  matrix  A  in  the  sequel 
Then  a  well-known  inequality  about  symmetric  matrix,  we  have 


I  (Ai(AN)-A.(E))2  <  tr(A^-Z)2 
i-1 


(2.10) 


The  relations  of  (2.4)  and  (2.5)  follow  immediately  from  (2.9)  and  (2.10). 
Lemma  2.3.  Let  x^,...,x^  be  iid  p*l  normal  vectors,  (again  take  notice 
of  that  x^,  1  <  i  <  N,  is  dependent  on  N),  x^  -  N  (0,1),  Z  >  0  and  is  unknown. 


Write 


\  “  N  ^  ^ -i”X0N^ -i~-0N^  ’ 


1  1  -  i  » 

®N  *  N  ^  (-i"-0N, )(-i'-0N, )  +  N  .  |  , /-i"XN  V^i"XN  V 


(2.11) 


i=N1+l 


where 


X.  .  =  T^T  V  X. 

-ij  J-i  ,  f,  - 


k=i+l 


Then  we  have  that  A^-B^  is  non-negative  definite,  and  for  k^,  k^,  0  <  <  k^  N, 

the  following  estimates  hold. 


rk2_kl  - 


(L+2) 


P{Amax(  N  Xk1k2i5k1k2)  -  C  N  }  <  C1N 


(2.12) 


P{  |  (loglAj^l -log|BN| )  >  C  logN}  <  Cj  N'^+2)  (2.13) 


Proof.  Without  loss  of  generality,  we  prove  (2.12)  for  k^  *  0,  k2  *  N  only. 
Let  y(N)  *  Z_ix„M.  Then  y(N)  -  N_(0,I_).  Write  y(N)  -  (y{N) . y^10)'. 


and  yv"V  '  *  lyjj  >:p*P.  where  Since  y ~  N(0,1),  for 

all  i,  1  <_  i  <_  p  it  is  not  difficult  to  see  that 

P(|y^N)|  >  Clog^N)  <  -z~ - r-  e  2  8  <  C  N‘(L+2) 

1  /2ir  ClogJN 


for  some  constant  C,  and  N  >  3. 


2  2 

From  the  fact  that  2|y^y^|  <  y^  +  y^,  it  follows  immediately  that 

p{||y(N)y(N)  ’ll  >  c  logN}.  <  c1N_a+2) 


-CL+2) 


PIMNxon*onII  >CLogN'}  "  C1N  , 


which  implies  (2.12). 


Here,  we  note  that 


1  N  _  . 

*N  =  N  l  ^ -i'-ON^ -i”-0N^ 


N^N-N^ 

BN  +  ^2  (?0N.  ’^N.N^-ON.'^N  V 

N  11  11 


and  for  any  vectors  x  and  y 


i  xx1-  yy'  <  (x-y)(x-y)'  <_  2(xx  +yy') 


(2.14) 


we  have 


2N1N2 

0  -  *N  ‘  BN  -  n2  ^Of^-ON^  ?N1N?N1N)' 


From  (2.12),  we  get 


WWW  V<L+2) 


It  is  well-known  that  for  any  symmetric  matrix  D2  and  >  0, 


A  (D-)  ,  ,  A  (D~ ) 

■“  2  <  x  (d;W)  <  "ax  L 


A  (D,  )  -  max '’  l  U2U1  ’  -  A  .  (D,  ) 
max  1  mm  1 


By  Lemma  2.2,  we  have 


P{ A  .  (Bxt)  <  \  A  .  (2)}  <  .C.N- 
^  mm  N  —  2  min  !  .1 


~(L+2) 


From  (2.15)  -  (2.17),  it  follows  that 


P{6.  >  C  logNVN}  <  ClN 


■  ( L+2 ) 


where  6.  =  A  .  (.  B  (A  -B.JB  }  ) . 
1  1  N  N  N  N 


From 


|(log|ANi-log|BNi)  =  f  log|(I+B‘i(AN-BN)B‘i)| 


N  .  N  P 


=  §  log (  n  (1+6.))  [5. 

’  i=l  L  i=l 


(2.15) 


(2.16) 


(2.17) 


(2.13)  follows. 

Remark  2.1.  Let  ir^  =  (k^,...,k^)  be  a  partition  of  [0,N],  where  l  is  less 


than  a  certain  constant  L.  Set, 

i  Vi 


V<N)  =  s  l  I  <vVk..,K5i'Vk.  y-  <2-18) 


j=0  i-k.+l 
•  J 


v.sv...  — a.  —  IV  ,  r*  .  .  . 

J  3+1  J  J+l 


where  k  -  0,  k».,  *  N.  Then 

o  t+1 


P{  A  .  (A  (N))  <  \  A  .  (2)}  <  C.N'(  . 
v  mm  1 to  -  2  mm  1 


-(L+2) 


(2.19) 
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To  see  that  (2.19)  hold  true,  we  observe  that 

l 


AN  "  Arr/N^  N  ^  (kj+rkj^-k.k .  ~-0N^-k.k  '?0N) 

<■  j=0  J  J+1  J  J+1 


l  l 

<  2  I (kj+rkj_  -  + 1  y  (k  -k  )x  x' 

N  -k .k ... -k.k .  .  N  V  j+1  j  -ON-ON 

-  j+1  j  j+1  j=0 


(k<+rM  . 

j=0  j 

i 

K,j_,  -K , 

i 

N  -k.k. -k.k. 


„  k ... -k . 

-2  l 

j-0  J  J+1  J  J+1 


-ON-ON 


From  (2.12),  we  have 


P{A  (Av-A  (N))  >  C  }  <  C.N 

1  max  S  i,  -  N  1 


'(1r+2)  (2.20) 


Write  Ai(AN'AiriN))  "  Ai  simPly  write  ^  (N)  =  (a.  .):p*p 


By  means  of  inequalitites 


X,  >  max  a. . 
1  "  l£ilP  11 


X .  X~  >  max  (a. .a.,  -  a..), 
1  2  ‘  iii«j«p  11  11 


we  have  for  any  e  >  0, 


P(I|AN  -  A^(N)(|-  >  e}  <  C1N_(^+2) 


(2.21) 


Combining  (2.9)  and  (2.20),  we  have 


p{  I  |A  (N)  -  Z  ||>  e}  <  CXN 


•(L+2) 


(2.22) 


Thus,  we  obtain  (.2.19). 

Remark  2.2.  Let  x^,...,x^  be  independent  p*l  normal  vectors  with  common 


covariance  matrix  E  >  0.  Assume  that 


E?i  *  •••  =  E^k1  =  Hi*  E?k1+1  *  •••  =  E*k2  *  h2* 


Exi.  E(xL  >  -  u..  Ex,.  .  ,  »  ...»  Ex..  *  u  „ ,  ,  . 


N  .  n  .  ,  ,  -i  ~k.k.+l  -i  ~k.k.,, 

j=0  i-k  +1  33  33+1 


for  it  ^  =  (k^f...,k^)f  we  have  for  z  >  0 

P{ | | (N)-Z | |>e}  <  C1N"(^+2) 


(2.23) 


The  proof  is  very  easy  if  we  notice  that  x^-y^ , . . . ,x^  -y^ ,  x^  ^^^-y^,.. 
x^  +^-y^+1  are  iid  pxl  normal  vectors  so  that  we  can  apply  Remark  2.1. 


Lemma  2.4.  Let  x  ,...,xN  be  independent  pxl  normal  vectors  with  the  same 
covariance  matrix  Z  ^  0,  and  tr^  =  <k^  ,...,k^)  be  a  partition  of  [0,N].  Assume 


Vi<liV 

where  j  =  1,2,.../+1,  and  kQ  =  0,  k£+j  =  N.  Set 

k.+N_ 

1  J 1  2  __  _ 

AjjCkj  ,NX  ,N2)  =  jj  l  (W-N  ,k.+N  )(V-k  -N^k  +N. 


(2.24) 


)’  (2.25) 


i=k.-N,+l 
3  1 


3N(kj  *Ni *N2)  n  l 

J  i=k . -N, 


(X.-X.  1.  -M  W 

+1  _1  'kj  Nl,kj  -1  3  1’  j 


k.«2 

+  N  1  ^i-ik.,k.+N,)(V5k.,k.+N  ’’ 

i=kj+l  3  3  2  3  3  2 


(2.26) 


where  <  k^  -  an<^  £  ^j  +  l  ”  •  ^et  11  ^+1  =  ^kl  ’  ‘  *  *  ,kj -1  ,kj  i  ’ 

kj+N2’  kj+l ’ ’ * ‘ ,k£^  3nd  *1+2  =  (k! . kj-1’kj'Nl’kj*kj+N2’kj+l‘---*k£J- 


Suppose  that  for  some  positive  constant  8, 


N1^°N’  N2  i  SDN 

(2.27) 

where 

D^  satisfies  the  following  conditions: 

(i)  lim  D ../  logN  =  « 

..  N 

N->oo 

(2.28) 

1 

(  ii)  lim  D../  S  =  0. 

N-  N 

(2.29) 

\ 

% 

Then 

there  exist  constants  C^  and  C 2  such  that 

1 

* 

m 

P(  Amax<AN<kj  ,N1  ■l,2)"Vkj  ’“l  •N2)  >  ‘  C1  "S  >  ‘  C3N_<'t2' 

(2.30) 

1 

Si 

and 

P(  2  (l08,\+1,'1°8f\+2[)  <  C2DN}  <  C3N"(L’2) 

(2.31) 

\ 

■S 

’i 

where 

A  ,  A  are  defined  by  (2.18)-. 

vi+l  vl+2 

% 

* 

1 

Proof 

.  By  calculation  and  the  inequality  (2.14),  we  have 

V 

A  -  A  =  AN<kj,N  ,N2)  -  BN(kj ,Nl ,N2) 

£+1  1+2  J  J 

1 

* 

N1N2 

~  N(N,+N-)  (-k.-N.,k.  -k.,k.+N_)(-k.-N.,k. 

12  J  1  J  1.  J  2  J  r  j 

X  )  * 

~k,,k.+N  ' 

J  J  2 

% 

N1N2  1 

-  N(N1+N2)  (2(-j+l  }( -j+1  -j}  2(-kj-M1,k 

^KVi'k 

» 

2(~k.,k.+N  '^j+l^-k. ,k.+N  "Hj+1*1 ) 

J  J  2  J  J  2 

»* 

=  (Jj-J^  J3}- 

(2.32) 

i 

w 

c 

From 

Lemma  2.3,  Eq:(2.29)  and  A  (J. )  >  —  tr(J,),  we  get 

max  1  -  p  1 

f 

i 

P{ A  U  +J  )  >  C  ^  }  <  CN'(L!2) 

max  2  3-  N  I 

(2.33) 

■ 

>" 

r 

i 

w 

i* 

By 

N  N  ? 

Xmax(JP  -  2p  N(N1+N2 )  ^(M2j  Mlj  ^  -  2ciDn/N’ 

12.34) 

Combine  (2.33)  and  (2.34),  the  inequality  (2.30)  follows.  By  (2.19), 


-(L+2) 


p{A  (A  )  >  f  A  (Z)}  <  C.N'^T 
max  it£+2  2  max  1 


(2.35) 


p  { A  .  (A  )  <  4  A  .  (Z)}  <  C,N'(L+2) 

mm  v£+2  2  mm  1 


By  (2.14),  (2.33)  and  expression  of  A 


Yh  V 1+2 


P(WAVl-A.aa>  i  2H“j+1-“j,|2}  <  Cl»'iL'2>  (2.36) 

Mote  that  i  there 


£+1  £+2 


exists  C  >  0  such  that 
4 


P{Xmav(ATi  (Air  >  CA>  <  CiN’  . 

max  tt£+2  ir£+1  +  £+2  -  4  1 


(L+2) 


(2.37) 


rf  Amax(Air0  })  *  c4>  there  exists  C  >  0  such  that 


e+2  t-t-i  "i+2 


I  ( log  |  A  |-log|A  |)  *  J  log  I  I  +  A  (A  -A  )A"*  )| 

e+1  e+2  2  v i+2  *e+ 1  "e+2  vi+  2 


>  f  log(l+A  [A'1  U  -A  ) J ) 

2  max  e+2  *e+i  11  e+2 


>  T  C.  A  [A  -A  )/X  (A  ) 
-  2  5  maa  t£+1  .{+2’  *£+2' 


(2.38) 


p{ 2^ log  I  A  | - log | A  | )  <  C7Dw} 

e+i  vz+2  2  N 


A  (A  -A  ) 
max  tt„  ,  ,  it  „ 


a  p{ a  (a'1  (a  -a  )]  >  c  }  +  p{-  r  _ e+i  e+2  f  . 

max  tt„  n»  .  .  tt„  .  1  -  ''V  rb  77  7  <  C-A,l 


3.  ESTIMATION  OF  CHANGE  POINTS  WHEN  q  IS  KNOWN 


Let  0  <  t ^  <  . . .  <  t  be  the  change  points  of  the  process  x(t).  We  can 
(N)  (N)  (N) 

find  a  partition  tt„  =  (k.  )  for  each  N,  such  that 

U  1  q 


Ex(^)  =  if  kj^|  <  i  <  kjN\  1  <_  j  <_  q+1 , 


(3.1) 


(N)  (N) 

where  k„  =  0,  k  .  =  N,  and  y.  ,  t  y.,  1  <  j  <  q+1 
0  q+1  -J-l  -J  ~  ~ 


In  order  to  simplify  notation,  we  write  x_.  for  x(^),  vq  for  and 


k.  for  k'.N). 
J  J 


For  the  integer  interval  [0,N),  there  exist  (  )  different  integer 

partitions  denoted  bv  K  .  Assume  that  it  =  ( k!  , .  .  .  , k 1  )GK  ,  and  h  is  a  hypothesis 

"  n  n  n  tr  -  r 


q  q 


such  that 


Ex  .  =  y‘.  if  k ,  <  i  <  k ,  1  <  j  <  q+1,  k'  =  0,  k'  =  N. 
-i  -J  J-l  -  J  ~  ~  0  q+1 


The  model  M  is  the  one  for  which  h  is  true.  Let  0  =  {(tr,  Z )  ;tt6K  ,  £  >  0}  be 
it  it  q 

a  parametric  space.  We  are  interested  in  selecting  one  model  based  upon 

observations  x, , . . . ,x„.  Under  h  ,  the  logarithm  of  the  likelihood  function  is 
-i  -N  IT 


i  qV  ,  -i 

A  \  /  r*  i 


logL(9)  =  -  y  log  j  Z  (  -  j  i  tr(  I!  A 

j-l  J 


where 


A  (N)  =  -  I  ,  U-x'  ; 

i=k  .  .+1  J-l  J 

J-l 


)(w  1k:r’ 

j-i  j 


j  =  l,2,...,q+l,  k'  =  0,  k'  =  N,  and  0  =  (tt,Z).  Let 
o  ■  q+1 


A  (N)  =  A  .  (N) . 
*  j-l  J 


It  is  not  difficult  to  see  that 


Sup  logL( 9 )  =  -  ~  log|A  (N)|  - 


(3.2) 


(3.3) 


(3. A) 


(3.5) 


where  (H.  denotes  the  parametric  space  under  h 


and  | A |  denote  the  determinant  of  the  matrix  of  the  matrix  A.  Now  let 


Gn  -  *  f  logjA^N)! 


(3.6) 


where  »  £  K  .  Then  there  exists  a  partition  ir  =  (k ,  >  •  •  •  .k  )  6  /r  such  that 


G-  ■  maxG 
*  * 


(3.7) 


kL  k  M 

Using  (—  ,  as  an  estimate  of  the  true  location  of  (t.,...,t  ), 

N  N  1  q 

we  can  prove  the  following  theorem: 

Theorem  3.1.  Let  x^ . x^  be  a  sample  of  size  N  drawn  from  the  process 

k^  k 

x(t)  in  equal  space,  where  x(t)  is  defined  by  (2.1)  Then  (— . -$•  )  is 

-  -  N  N 

a  strongly  consistent  estimate  of  (t^,...,t  ). 

Proof.  For  each  N,  let  if  =  (k,,...,k  )  be  a  partition  so  that  (3.1)  holds. 

o  1  q 

It  is  easy  to  see  that 

ki  1 

Itf  t.|  <  j  ■  1,2 . q  (3.8) 


Now,  take  constants  which  satisfy  (2.28)  and  (2.29).  Then  the  adjacent 

intervals  <k.  -  D..,k.  +  D..)  and  (k.^,  -  D..,  k_.  +  D..)  are  non-intersecting 

j  N  j  N  j+1  N  j  +  1  N  6 

-(N) 

for  large  N  and  all  j  *  1,2, ... ,q.  Define  r  *  {n 1  -  ( k! , . . . ,k ' ) :  3  j  <  q 

q  i  q  J  ^ 

such  that  {k|,...,k^}  0  {k^-D^,  k^+D^}  =  $}.  We  shall  prove  that  with  prob¬ 
ability  one  for  large  N, 


G  ,  <  G  for  all  u'6  K(N). 

TT  IT  Q 

O  M 


Take  a  partition  it1  *  ikj  , .  . .  ,k '  )£  K  »  we  can  construct  a  new  partition  tt. 

1  q  q  1 

which  has  all  cut-points  of  both  v  and  it'  except  the  point  k  .  Denote  n.  bv 

o  j  1 

(kj, . . .  ,k£) .  Assume  k^  drops  into  (.k1^  ,k')+i  J .  Let  =  (k” . kV.k  ,kV+1 , .  .  . 

From  Lemma  3. A,  with  probability  for  large  N  we  have 


P{%  •  Gn1  i  W  <  CrN 


- ( L+2 ) 


(3.9) 


Since  Ti  *  ( k  7 ,  . .  .  ,K')  is  a  refinement  partition  of  it',  it  is  easily  proved  that 


15 


(Aii,-Air  )  is  a  non-negative  definite  matrix  based  on  the  expressions  of  A^ 


and  A  ' .  So 

it 


G  ,  <  G  . 

IT  '  —  IT 


(3.10) 


At  the  same  time,  for  tt^— <  it^,  we  can  sequentially  construct  rr^  *  (kj . k'.', 

k . ,k" . , . . . ,k”)  from  (k,,...,k  )  =  tt  by  means  of  adding  one  cut-point  of  ir'  each 

ji+1  lqo 

time.  Let  tt^+^  denote  a  partition  which  has  all  cut-points  of  and  one 
cut-point  of  it'  different  from  k^,...,k  .  ^q+2  a  Part^ti°n  which  has  all 

cut-points  of  and  one  cut-point  of  it'  different  from  all  cut-points  of 

it  .  Thus  we  obtain 

q+i 


TT  =  TT  >-TT  L  IT  .  1T„  ,  ,  *  1T_  . 

o  q  q+1  ^  q+2  ~  ~  l  +1  2 


By  Lemma  2 . 3 


i  -  G  >  C  logN }  <  C  N'(L+2),  for  j  =  0,1 . f-q,  a.s. 

<l+j  +  l  q+j 


Hence  tt  >—  tt ^  implies 


PIG  -G  >  LClogN)  <  LC, M 

'  IT  TT  -  1 

2  o 


•(  L+2 ) 


(3. 11) 


Combining  (3.9)  -  (3.11)  and  noticing 


G  ,  -  G  =  (G  ,-G  )  +  (G  -G  )  +  (G  -G  ), 

Ti  tt0  it  ^  v2  tt2  ttq 


we  have  for  large  N 


>  -  -  Dn}  <  (L+DCjN 


-U+2) 


(3.12) 


Note  that  the  constatns  and  are  independent  of  the  choice  of 


n'c  Since  <K K ^)  <  NL»  we  have 

q  q 


-( n)  . 

j  ,-G  >  0  for  at  least  one  n'eK  }  <  C. (q+1. 

TT  1  TT  —  q  i 


Bv  Borel-Cantelli ' s  lemma,  with  probability  one  for  large  N, 


it  *  (kL . kq)  t  , 


which  implies  that 


i-i  -  t  I  < 

1  N  j1  N  ’ 


Thus  the  theorem  is  proved. 


6 


4.  ESTIMATES  OF  THE  NUMBER  AND  LOCATIONS  OF 
CHANGE  POINTS  WHEN  q  IS  UNKNOWN 


When  the  number  of  the  change  points  is  unknown,  we  have  to  estimate  q. 

For  this  purpose,  some  notations  are  introduced.  Suppose  that  *  (k^\...,k^^) 

is  an  integer  partition  of  [0,N].  Set 


K  «  {(k|N),...,k<N)),  0  <  l  <  L,  0  <  k^- 


■  <  k(£N)<  N}  (4.1) 


( N )  ( N ) 

where  f  ^  L  and  L  is  a  constant.  ,  k^  will  be  written  as  rr^.k^  respectively  below 

Let  the  sequences  C^  and  satisfy  the  following  conditions: 


^N  ^N  ^n 

TSiN  ■  "  cT  *  TT  ■  °' 

N-*°°  B  N-**  N  N-*°° 


(4.2) 


^  ’  2  108  (\(N)|' 


(4.3) 


.  -(N)  ,-(N)  -  ( N )  , 

Assume  *  =  (k^  ,...,k  ^j)  6  K  is  a  partition  of  maximizing  when 

t  runs  over  all  integers  of  ( 0 , L ] ,  i.e., 

H(£(1N,.....k(.X))  «  max  H^N).  (4.4) 

h  ».€C  ^ 

tw  ^N)  :(n)  J  < 

men,  Kj  . K-(N)  are  8rouPed  int0  some  groups  by  the  following  procedure: 

h 

Let  k{N)  belong  to  the  first  group,  say  mJN).  If  k<N)  -  k<N)  <  n  k^N)  also 

A  L  1  N  2 

belongs  to  ;  otherwise  is  called  an  element  of  the  second  group  . . . 

*(N )  *" 

In  general,  assume  k\  e  M.  .  Then 

t  l 


Tm(N)  if  k(N)  J(N)  n 

u(N)  \  Mi  *  lf  k£+l  '  k£  c  Dn 

£+1  i 

,  otherwise. 


(4.5) 


Based  on  this  procedure,  we  get  groups  . M-N^  finally.  Let 

-  (N)  (N)  _  ^N 

Ki  .  an  element  of  M  ,  j  »  1 , 2 , . . . ,q  .  Then  we  have  the  following  theorem: 


Theorem  4.1.  Under  h  .  we  have 

-  ^ 

o 


(i)  q. 


(ii)  (- 


r(N) 

^(N) 


a. s . 


(4.6) 


(4.7) 


*(N)  (N) 

Proof.  In  order  to  simplify  notation,  we  write  k.  for  k.  »  for  it'  ,  etc. 

J  J  I  I 

If  the  hypothesis  h  is  true,  then  there  exists  a  partition  it  =  (k.,...,k  ) 

»  o  1  q 

o 

so  that  (3.1)  holds.  From  (3.11),  for  some  C  >  0, 

I  P{G  ,  -  G  >  CLlogN)  <  » 

IT  '  G  K.  IT  <  TT  £  nO 

o 

Then,  with  probability  one  for  large  N,  we  have  for  any  l  <  L  and  ir£  *  (j  . )  u 

-  K,  ■  -  G,  -  iCN  +  "CN  ■  0<lo*N)  ■  «-*>cN 

■LOCO 


*  '  2  CN  <  °- 


(4.8) 


This  fact  shows  that  the  maximum  of  H  cannot,  reach  at  refinement  parti- 

1  3  * 


tions  of  nQ.  Next,  set  *  D^C^.  follows  that 


Def ine 


R*,  C  Rj 

lim  jr-  ■  0,  lim  p-  ■  0,  and  lim  *  0. 

N-~  N  N-*—  N—  NLN 


k' )€K:  ^ j  <  q  such  that  |k!-k  |  >  R  for 
it  —  1  j  N 


(4.9) 


Then,  following  the  same  lines  of  obtaining  (3.12),  we  have, 
with  probability  one  for  large  N, 

„rc;(N)  P{V  -  G*  -  •  r  V  "  • 


(4.10) 


U  WWW  WWW  UUUWW  U  W  V.  V  HF  WW  V  k»  <w 
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-  (  M  ) 

and  with  probability  one  for  large  N,  we  have  for  any  tt '  =  (k.1  , . . .  ,k»)6K  , 

C  1  ^ 

K;  -  K  -  V  -  c„  -  «-»)cn  ‘  -  r  **  +  I£-’Icn  ‘  -  r  %  *  °-  (4-ll) 

l  o  l  o 

This  inequality  also  shows  that  the  maximum  of  cannot  reach  at  it1  above. 

In  other  words,  if  it  =  (k,,...,k-)  is  a  partition  of  these  maximizing  H  ,  ir  e  K  ,  then 

1  h  t r 

at  least  one  cut-point  of  it  falls  into  each  R^-neighbor  of  k^  ,  1  <  j  <  q, 
with  probability  one  for  large  N.  Hepce,  based  on  our  procedure  of  grouping 

k^,...,kj-,  it  is  hnown  that,  with  probability  one  for  large  N, 

qN>q.  (4.12) 

Further,  we  prove  that  all  cut-points  of  it  =  (.  k  , ...,k-)  must  drop  into 
«  DN  °N 

u  (k.  -  — r,  k.  +  — ),  with  probabilitv  one  for  large  N. 

,  _ ,  ]  2  i  2  *  6 


First,  suppose  tt^  =  (kj,...,k^)  satisfies  the  following  condition: 
- 1  cir  p(  N ) 

l  fc  "  *  '  nl  VQ  and  for  some  i  <  l, 

( *  ) 

q 

k'  e  U  (km‘RN»km+RN) »  b  ~  1,2, . . . , i-l, i+l, . . . ,  l 

b  m=l 


and 


k!g  u  (k  -R  /2,k  +R  /2). 
i  .  m  N  m  N 

m=  1 


Wr 


ite  11  £-1  ~  » •  •  •  »k[_i  >  •  •  •  ,kp  •  is  obvious  that  tt^  tt'^.  Sinct 


A  »  *  A  ■  *  m  (k  -k‘  ,Xx,  ,  ,  ,-x,  ,  .  ,  )(x,  ,  ,  ,-x,  ,  ,  , 

w  o  ,  *  p  N  i  i-l  -k!  k!  -k!  k'  ,  -k!  ,k!  -k'  ,k  . 

<-  1  c  1-1  1  l-I  1+1  1-1  1  1-1  i+l 


)' 


+  N  (lti+rki)(^k,k!  " -k  !  k!  .,)(-k!k!,  ‘-k*  ,  k '  ,  } '  -°  (4'13) 

i  i+l  l-l  i+l  ii+I  i-l  i+l 

Assume  that  k^_^  <  k^_^  <  <  k!^  <  k^  (see  figure  i)(,For  other  permutations  of 

k^- i , k | , k^+i , k  _ i  and  k  ,  the  treatment  is  similar.) 


-  -  N  -r  N /  2  -■ 


-°N/2 


,-rn 


k*  k.  , 

i-l  ; - l 


k' 


k' 


i+l 


Figure  1 


Set 


ki-l  -  ki-l  n  , 
a  =  P  =  1  "  a- 
1  1-1 

Making  routine  calculation  and  noticing  (2.14),  we  can  change  (4.13)  into  the 


form. 


(ki‘ki-i)(ki+rkl)  - 

~  i  N(k'  -k'  )  ^aXk'  ir  +^Xk'  k  ~Xk' k' 

l- 1  l  N  l+l  ki-r  ~Ki-lkj  -1  "j-lkj  kiki+l 


'  <“*!  ,k.  ,k!'Vk’  >' 

1-1  J-l  j-1  1  1  1+1 

2(k!-k:.l)(k;+1-k:) 

1  Vf^rMTii - “ 


I6(k! -k!  ,)(k'  ,-k!)  0 

.  i  i-I  l+l  i  ,  2,~  w- 

N ( k !  - k !  )  [a  -k!  k.  . "-j-l  -k!  k.  . "-j-l) 

l+l  i-I  i-I  j-l  J  i-I  j-l 


+  g2(x 


-k  k,_Mi^-k  k 1  ”  -  i  ^  *  +  ^xk  k’  ’-i^-k'k'  *-i 

j-l  i  "  j-l  i  li+l  i  l+l 


=  h  +  I2 


(4.14) 


But  by  Lemma  4.3,  we  get  uniformly  for  all  those  ir^  satisfying  (*), 

A  U7)  =  0(^^)  =  o(^)  a .  s .  (4.15) 

max  2  N  m 


(k .  -k!  , )2( k!  -k! ) 

A  (I  )  =  0(— J ^  ■  1 - _ 1— )  _  -1 — ) 

maxlV  UlN(k!-k'  ,)(k’  ,-k!  ,)J 

l  i-I  l+l  i-I 


(4.16) 


ft 


Since  kj_j-kJ^<  R^,  k^  -  k  ^  >  — ■  from  (4.2),  for  large  N,  k!^  -  k^_j  = 

°N 

(k^-lr  ^)  +  (kj_^-k|_^)  >  — ,  we  uniformly  for  all  those  n ^  satisfying  (M)f 


r2  c 

\nax(Il}  =  0(^:)  *  o(^)*  a-s- 

N 


(4.17) 


V 

y 

i* 


X  (A  -A  ')  <  X  (I.)  +  X  (I.)  =  o(rr-),  a.s. 
max  it  —  max  1  max  2  N 


(A. 18) 


From  this  inequality  and  Remark  2.2,  with  probability  one  for  large  N,  we  have  for 
all  those  irj>  satisfying  (*), 


H  >>  -  H 

1  " 


’  (f-‘)CN+tCN 


'  °<V  *  CN  '  2  CN  >  0 


(4.19) 


*  ( N  ) 

Next,  set  V  e  K  ~  K  .  Assume  that  it'  V  n  and  there  exist  more  th 

i  l  ° 


one  cut-points  of  n'  in  the 

t 


complementar 


v  set  of  U  (k.-  — k .+  — .  Writ 
j*I  1  2  2  2 


-  •  s  fir '  W'  If*  k'  k'l 

71  £  V  K.J  »  •  •  •  *  K  •  >  “ii! . •‘ij.  j.l . 


i’  i+1 


i+ur+1 ' 


and  suppose  that  there  exists  a  cut-point,  k^  say,  of  such  that 

°N  °N 

ki  e(kj-r  r*kj-i+r  > 


k.  e(k  .  DJi  k  +  Ha  ) 

i+w+1  ZK  i  2*  j  2  ’ 


k  • 

i+r 


, k'  c  [k  +  ~,k  - 
i+u>  j  - 1  2  j  2 


*.  <  °N 


<  dn  -k  <  dn 


k 1  k '  k '  k ' 

i+1  i+2  i+ui-1  i+uj 


k  1 

i+ur+1 


l  ~ur+m+l 


Figure  2 


k  1  k ' 

*  i*  i+i" 


k  '  k  '  k  ' 
’l+m’i+uj  i+ur+1  ’ 


•  > Rp  )  m  =  0  •  1  •  •  •  •  iuj-  1 . 
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£  -u)  1 


(k'**--*k-»  ki+urH’ •  •  •  .k'*)- 


From  (4.18),  we  get  uniformly  for  all  those  and  tr^  -(JJ+mat>ove  ’ 

CN 

A  (A  -  A  )  =  o(-r:),  a.s. 

max  it  *  tt  <  N 

Z-Ui  £-(jJ-t"l 

Following  the  same  lines  as  obtaining  (4.15),  we  have  uniformly  for  all  those 

it'  above,  r 

£-urhn 

A  (A  ,  -A  ,  )  =  o(-t:)  ,  a.s. 

***  'i-m’i-** i  s 

Again  notice  that  Ex.  1  -  u.,  r  »  i  +  1 . i  +  u>,  hence  hy  Lemma  2.3.  »'  have 

“Kr  J 

uniformily  for  it'  and  n'  , 

-C-U)  -C 

OJ-l 

A  (A  ,  -A  ,  )  <  l  A  (A  ,  -A  ,  ) 

maX  V  ~  mt0  maX  Vurfm  Vu*nH-l 

<  o(i)  +  («-2)  O(^)  =  o(^),  a.s. 

From  this  fact  and  the  Remark  2,2  of  Lemma  2.3,  it. is  proved  that  with  probability 

one  for  large  N,  we  have  for  all  those  tt  1  above,  there  exists  tt  „  such  that 

l  £-w 

-  H,  =  -  |  Uog|A  |-log|A  |)  -  <i-h)CN  +tCN 
t-u)  C  t-oj  t 


°lCN>  +  hCN  >  i  CN  >  0. 


(4.20) 


The  inequalities  (4.19)  and  (4.20)  show  that  the  maximum  of  H^,  n  t  K  , 

can  not  reach  at  partitions  for  which  there  exists  one  cut-point  at  least  droping 

out  of  U  (k.-  — 2>k.+  —  ).  According  to  our  procedure  of  grouping  k^,...,k^, 
j=l  J  J 

with  probability  one  for  large  N,  no  two  or  more  groups  exist  in  each 
— 2  -neighbor  of  k^ .  Hence 


q  <  q  a.s. 


(4.21) 


Combining  (.4.12,1  and  (4.211  we  have,  with  probability  one  for  large  N, 


qN  =  q- 


v  / 
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5.  TESTS  OF  HYPOTHESES  FOR  DETECTION  OF  CHANGE  POINTS 


F- 


First,  we  assume  that  Z  is  known  and  the  number  of  change  points  is 
unknown.  Let  tu  =  p  ^  to r  i  =  1,  2,...,  N-l.  Then  we  can  test  (see 
Krishnaiah  (1969))  the  hypotheses  simultaneously  as  follows.  Let 

) 


for  i  =  1 , 2 , . . . , (N-l ) .  Then,  we  accept  or  reject  H^  according  as 

2  <r 

X,  >  c 

i  >  a 


where 


2  N-l 

P[ y  1  c  ;  i=l ,2, . . . ,N-1 |  n  H, ]  =  (1-a). 
i  a  i»!  i 

2  2 

The  joint  distribution  of  is  a  multivariate  chi-square  distribution. 

Tables  for  approximate  percentage  points  of  c^  are  given  in  Krishnaiah  (1980). 

If  £  is  unknown  but  an  independent  estimate  (S/vj  of  £  is  available,  we  use  T~ 
instead  of  as  test  statistics  where 

T.  =  (x.-x  )‘(S/v)  Hx.-x 

l  -l  -i+l  -l  -i+l 


We  can  determine  the  number  of  change  points  and  estimate  the  locations  of 

N-l 

change  points  by  the  above  method.  Here,  we  note  that  n  H.  indicates  that 

i=l  1 

=  ...  =  and  no  change  points  exist.  If  are  all  simultaneously 

rejected,  then  we  have  (N-LJ  change  points.  If  q  of  the  hypothesis  H . ( i*l , . . . , 

i 

N- i ;  are  rejected,  then  there  are  q  change  points.  Suppose  0  H .  is  accepted 

j=l  J 

and  is  rejected,  a  change  point  occurs  at  t^.  Suppose  q  is  known,  then 

t£  ,...,t£  are  change  points  where  they  are  chosen  as  follows: 

1  q 


=  max{xi;  i=l,...,N'l;  i  t  k^k^ 


2  2  - 
X£  =  max{xi;  i=l . N-l;  i  ^  k^-.-.k  }. 


We  can  use  finite  intersection  tests  proposed  by  Krishnaiah  (1965)  for 
multiple  comparisons  of  mean  vectors  also  to  estimate  the  locations  of  change 
points . 
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